Eur. Phys. J. C manuscript No. 

(will be inserted by the editor) 



Prospects for the Measurement of the Higgs Yukawa Couplings to b and 
c quarks, and muons at CLIC 

Christian Grefe a \ Tomas Lastovicka b 2 , Jan Strube c 1 

'CERN, Switzerland 

institute of Physics, Academy of Sciences of the Czech Republic, Prague, Czech Republic 



(N 

o 

< 



X 
i 



> 
O 
G\ 
00 
(N 

00 

o 

(N 



X 



Received: date / Accepted: date 

Abstract The investigation of the properties of the Higgs 
boson, especially a test of the predicted linear dependence of 
the branching ratios on the mass of the final state is going to 
be an integral part of the physics program at colliders at the 
energy frontier for the foreseeable future. The large Higgs 
boson production cross section at a 3 TeV CLIC machine al- 
lows for a precision measurement of the Higgs branching 
ratios. The cross section times branching ratio of the de- 
cays H —> bb, H — > cc and H — > u+uT of a Standard Model 
Higgs boson with a mass of 120 GeV can be measured with 
a statistical uncertainty of 0.23%, 3.1% and 15%, respec- 
tively, assuming an integrated luminosity of 2 ab _1 . 



1 Introduction 

The Higgs mechanism of the Standard Model predicts the 
existence of a fundamental spin-0 particle. Recently, the AT- 
LAS and CMS experiments at the LHC have observed a 
particle which is consistent with the predictions for a Stan- 
dard Model Higgs boson, but its properties remain to be 
studied [1, 2]. In particular, the Standard Model predicts a 
linear dependence between the Higgs branching ratios to 
fermions and their mass. This relation could be altered by 
the presence of new physics. The detailed exploration of 
the Higgs sector is thus instrumental to our understanding 
of the fundamental interactions. The compact linear collider 
(CLIC) is a proposed e + e~ collider with a maximum centre- 
of-momentum energy s/s = 3 TeV, based on a two-beam ac- 
celeration scheme [3]. The Higgs boson production cross 
section of 421 fb in the dominant W-fusion channel allows 
for precision measurements of the Yukawa couplings. The 
beam of the 3 TeV CLIC consists of bunch trains of 312 
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bunches, which are separated by 0.5 ns. The small beam size 
and large electric field in the bunches, required to achieve 
the peak luminosity of 5.9 x 10 34 cm _2 s _1 , lead to a large 
cross section of real and virtual two-photon processes that 
are a background to the processes of interest produced in the 
electron-positron collision. On average, 3.2 YY ~* hadrons 
events are produced at every bunch crossing at y/s =3 TeV. 

We present simulation studies of the measurements of 
the branching ratios H — > bb, H — > cc [4] and H — > [5] 
at such a machine. These studies of the Higgs branching ra- 
tios are part of the benchmarking analyses presented in the 
CLIC Conceptual Design Report [6]. They are carried out 
in a GEANT4-based simulation [7] of the CLIC_SiD [8] de- 
tector concept, with full account of Standard Model back- 
grounds and using a realistic reconstruction in presence of 
YY — > hadrons background. The latter is reduced partly by 
removing hits that are out of time with the physics process, 
partly by advanced off-line reconstruction techniques. 



2 The CLIC_SiD Detector Model 

The CLIC_SiD detector model in which these studies are 
carried out is a general-purpose detector with a 4 n coverage 
and is based on the SiD concept [9] developed for the ILC 
It has been adapted [8] to meet the specific detector require- 
ments at CLIC. It is designed for particle flow calorimetry 
using highly granular calorimeters. 

A superconducting solenoid with an inner radius of 2.7 m 
provides a central magnetic field of 5T. The calorimeters 
are placed inside the coil and consist of a 30 layer tungsten- 
silicon electromagnetic calorimeter with 3.5 x 3.5 mm 2 seg- 
mentation, followed by a tungsten-scintillator hadronic cal- 
orimeter with 75 layers in the barrel region and a steel- 
scintillator hadronic calorimeter with 60 layers in the end- 
caps. The read-out cell size in the hadronic calorimeters is 
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30 x 30 mm 2 . The iron return yoke outside of the coil is in- 
strumented with nine double-RPC layers with 30 x 30 mm 2 
read-out cells for muon identification. 

The silicon-only tracking system consists of five 20 x 
20 um 2 pixel layers followed by five strip layers with a pitch 
of 25 um, a read-out pitch of 50 um and a length of 92 mm 
in the barrel region. The tracking system in the endcap con- 
sists of four stereo-strip disks with similar pitch and a stereo 
angle of 12°, complemented by seven pixelated disks in the 
vertex and far-forward region at lower radii with pixel sizes 
of20x20um 2 . 

The forward region is instrumented with a LumiCal, with 
coverage down to 40 mrad, and a BeamCal, with coverage 
down to 10 mrad. 

The trigger-less readout integrates over 10 ns for all sub- 
detectors except the hadronic calorimeter, which has an in- 
tegration time of 100 ns to allow for shower development 
in the tungsten absorber. The silicon detectors allow time 
stamping of the recorded hits with a precision of a few ns. 



3 Analysis Framework 

The physical processes are produced with the Whizard [10, 
1 1] event generator, taking into account the CLIC beam spec- 
trum, with fragmentation and hadronisation handled by the 
PYTHIA [12] package. The branching ratios of a 120 GeV 
Standard Model Higgs boson are: BR(H ->• bb) = 6.48 x 
10 _1 , BR(H cc) = 3.27 x 10~ 2 and BR(H -> \i+[T) = 
2.44 x 10~ 4 [13]. The events are simulated in the CLIC_SiD 
detector model using SLIC [14], which is a thin wrapper 
around GEANT4. They are reconstructed by the org . lcsim 
and slicPandora packages. Unlike in analyses at lower- 
energy linear colliders, which use DURHAM-style jet find- 
ers that operate on all particles in the event, it was found that 
the beam-jets of algorithms originally developed for hadron 
colliders, lead to a crucial improvement of the jet-energy 
resolution and reduce the effect of the forward-peaking yy — > 
hadrons events greatly. In the analysis of Higgs decays to b 
and c quarks, we use the k t algorithm [15] as implemented 
by the FastJet [16, 17] package. The LCFI [18] package 
is used for flavour tagging. The assumed luminosity of the 
analyses is 2 ab~ 1 , corresponding to about 4 years of data 
taking at nominal conditions, assuming 200 days of running 
per year at an efficiency of 50%. 



3.1 Rejection of yy — » hadrons backgrounds 

A 3 TeV CLIC produces 3.2 yy — > hadrons events per bunch 
crossing on average. The spacing of 0.5 ns between bunches 
leads to pile-up in the subdetectors, which integrate over 
multiple bunch crossings. Identifying the time of the physics 
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Table 1: List of processes considered for this analysis with 
their respective cross section a and the number of simulated 
events Advents- The cross section takes into account the CLIC 
luminosity spectrum. Cross sections marked with * include 
a cut on the invariant mass of the muon pair to lie between 
100 and 140 GeV. 

event and reading out only a window of 10 ns for the sub- 
detectors, except for the barrel of the hadronic calorimeter, 
for which 100 ns are read out, reduces the the number of 
yy — > hadrons events in the data sample by about a factor of 
15. 

To take into account the effect of this background on 
the measurement, a sample of events from yy — > hadrons 
corresponding to 60 bunch crossings is mixed with each 
physics event for the analysis of the Higgs decaying to b and 
c quarks. In the H — > u+pT analysis, only the signal sam- 
ple was mixed with events from yy — s- hadrons background. 
These events are also simulated in the GEANT4 model of the 
CLIC_SiD detector. The equivalent of 60 bunch crossings 
is a compromise between realistic description and compu- 
tational constraints. The yy — > hadrons events are forward- 
peaking; they are described in more detail elsewhere [19]. 
Their contribution to hits in the barrel hadronic calorimeter, 
which, in principle, accumulates the equivalent of up to 200 
bunch crossings, is small. Table 1 lists the physics processes 
that were taken into account in the analyses. 

In addition to applying read-out windows off-line, the 
computation of the cluster time allows to further reduce this 
background. Assuming ns precision of the calorimeter hit 
times results in sub-ns precision for the cluster time, which 
is calculated as a truncated mean of the corresponding hit 
times. The production time of the reconstructed particle is 
obtained by correcting the cluster time for its time of flight 
through the magnetic field. The production time of the par- 
ticle is required to be consistent with the start of the physics 
event. Consistency is defined by a time window, whose size 
depends on the type of particle (hadronic or electromag- 
netic), its momentum and polar angle 9. This reduces the 
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(a) (b) 

Fig. 1 : Left: Efficiency of tagging a b-jet as c (blue lines) or 
light (green lines) versus tagging it as b. Right: Efficiency 
of tagging a c-jet as b (red lines) or light (green lines) ver- 
sus tagging it as c. The solid lines show performance with 
pile-up of yy — » hadrons events, the dashed lines without this 
background. 



(a) H -> bb (b) H — > cc 

Fig. 2: Statistical uncertainty of the measurement of cross 
section times branching ratio versus selection efficiency of 
the neural network. Left: The neural network was trained to 
identify H^bb decays from di-jet backgrounds including 
H — > cc. Right: The neural network was trained on H — >• cc 
as signal and di-jets including H — ► bb as background. 



energy from yy hadrons processes to the event further by 
a factor of 6 or more, while only about 0.5% of the energy 
from a typical physics event is removed [20]. 



4 Measurement of Higgs decays to pairs of b and c 
quarks 

The particles passing the pre-selection based on the recon- 
structed production time are clustered into two jets using the 
k t algorithm as implemented in the FastJet package. The 
LCFI flavour tagging package finds secondary vertices in 
each jet and uses them, along with complementary track- 
based information, in a neural network to distinguish b-, 
c-, and light quark jets. Figure 1(a) shows the mis-tag rate 
for c-jets and light jets as b-jets versus the b-tag efficiency, 
while Figure 1(b) shows the mis-tag rate for b-jets and light 
jets as c-jets versus the c-tag efficiency. The presence of 
YY — > hadrons background is found to reduce the flavour tag- 
ging performance. This effect is correlated with degraded jet 
finding quality due to the yy — > hadrons background. For in- 
stance, at the b-tag efficiency of 70% the mis-tag rate for c- 
jets (light jets) increases from 4.3% (0. 19%) without overlay 
to 6.8% (0.33%) with overlay. The main SM background of 
the measurement of the decays H — > bb and H — > cc is from 
two-jet processes e + e~ — » qqvv, due to their large cross sec- 
tion, and from processes with two measured jets and ad- 
ditional particles that escape detection. The invariant mass 
of the jet pair is the major discriminant between decays of 
Higgs and of Z bosons. It is used in a second neural net- 
work, together with the output of the b-flavour-tagging net- 
work and the following variables: 

- The maximum of the absolute values of jet pseudora- 
pidities. 



- The sum of the remaining LCFI jet flavour tag values, 
i.e. c-tag against u d s b-background, c-tag against b- 
background and b-tag against uds-background. 

- Rnd, the distance of jets in the 77 — plane. 

- The sum of jet energies. 

- The total number of leptons in an event. 

- The total number of photons in an event. 

- The acoplanarity of the jets. 

The neural network selection efficiency S/S tota i versus the 
statistical uncertainty y/S + B/S on the measurement of the 
number of signal events S and background events B is shown 
in Figure 2 for the two neural networks that were trained on 
H — » bb and H — > cc as signal, respectively. The optimal se- 
lection is at the local minimum of the curve, at a selection 
efficiency of 55% for H — > bb with a sample purity of 65%, 
corresponding to a statistical uncertainty of 0.23%. The op- 
timal selection for H —> cc has an efficiency of 15%, cor- 
responding to a sample purity of 24% and a statistical un- 
certainty of 3.1%. Purity values reflect the fact that b-jets 
can be distinguished from c-jets with high purity, while in- 
completely reconstructed b-jets make up a large fraction of 
the background to c-jet selection, making the analysis more 
challenging. 

5 Measurement of Higgs decays to pairs of muons 

The measurement of the rare decay H — > (i + ur requires high 
luminosity operation and sets stringent limits on the momen- 
tum resolution of the tracking detectors. The branching ratio 
of the decay of a Standard Model Higgs boson to a pair of 
muons is important as the lower end of the accessible de- 
cays and defines the endpoint of the test of the predicted 
linear dependence of the branching ratios to the mass of the 
final state particles. 
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5.1 Event Selection 

The average muon reconstruction efficiency for polar an- 
gles greater than 10° is 99.6% without YY ~> hadrons back- 
ground. When adding this background the muon reconstruc- 
tion efficiency deteriorates to 98.4% in this region of polar 
angles. The efficiency for smaller polar angles is limited by 
the acceptance of the tracking detectors. The events are re- 
quired to have at least two reconstructed muons, each with a 
transverse momentum of more than 5 GeV. In case there are 
more than two muons reconstructed, the two most energetic 
ones are used, which are referred to as u^ and \i2- In addi- 
tion, the invariant mass of the two muons M(\i\i) is required 
to be between 105 GeV and 135 GeV. The total reconstruc- 
tion efficiency of the signal sample is 72% in the presence 
of YY ~ > hadrons background. The inefficiency is dominated 
by acceptance effects. 

The event selection is done using the boosted decision 
tree (BDT) classifier implemented in TMVA [21]. The u+uT, 
t + t~ and t + iTvv samples are not used in the training of the 
BDT, but are effectively removed by the classifier neverthe- 
less. The variables used for the event selection by the BDT 
are: 

- The visible energy excluding the two reconstructed muons 
F ■ 

- The scalar sum of the transverse momenta of the two 
muons PT(m)+/?T( u -2)' 

- The helicity angle cos0*(u|i) = | p ^/)|>(^V where P ' 
is the momentum in the rest frame of the di-muon sys- 
tem. 

- The relativistic velocity of the di-muon system P(u.u,), 
where P = ^ ■ 

- The transverse momentum of the di-muon system pj(\i\i). 

- The polar angle of the di-muon system 0(u,u,). 

The most powerful variable to distinguish signal from back- 
ground events is the visible energy whenever there is an 
electron within the detector acceptance. Otherwise the back- 
ground can be reduced by the transverse momentum of the 
di-muon system or the sum of the two individual transverse 
momenta. Figure 3 clearly shows the Higgs peak in the in- 
variant mass distribution after the event selection. 

5.2 Invariant Mass Fit 

The number of signal events is determined by an unbinned 
maximum likelihood fit of the invariant mass distribution 
of the combined signal and background sample. This sam- 
ple is randomly selected from all simulated events, accord- 
ing to the assumed integrated luminosity of 2 ab~ ! . The ex- 
pected shapes of the signal and background contributions 
are determined from a fit to the full statistics of the respec- 
tive sample. The distribution of the invariant mass in the 
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Fig. 3: Maximum Likelihood fit of the Higgs mass in the 
data sample after selection cuts 
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Table 2: Dependence of the statistical uncertainty of the 
measurement of cross section times branching ratio for the 
decay h — > u, + uT on the momentum resolution a(Apj)/ p\. 
The study assumes an integrated luminosity of 2 ab _1 . The 
values do not include the impact of the YY ~* hadrons back- 
ground and the possible reduction of the e + e~ — > u. + uTe + e~ 
background using electron tagging in the forward calorime- 
ters. 

e + e~ — > H — > u+uT sample has a tail towards lower masses 
because of final state radiation. It is described by two half 
Gaussian distributions, each with an exponential tail. The 
background is well described by an exponential parametri- 
sation, obtained from a background-only sample. 

The BDT selection with the highest signal significance 
yields a total signal selection efficiency of 21.7%, corre- 
sponding to about 53 selected events in 2 ab _1 . The relative 
statistical uncertainty on the cross-section times branching 
ratio obtained from the fit of the invariant mass distribu- 
tion is 26.3%. This corresponds to a signal significance of 
approximately 3.8(7. Without addition of the YY — ► hadrons 
background the relative statistical uncertainty on the cross- 
section times branching ratio improves to 23%, due to higher 
signal selection efficiency. 



5.3 Study of the Momentum Resolution 

The ability to measure the decay H — > u+uT depends cru- 
cially on the momentum resolution of the tracking detec- 
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tors. In a fast simulation study, different values for the mo- 
mentum resolution were applied to the true muon momenta. 
For each assumed momentum resolution an individual BDT 
was trained to optimise the event selection for the invariant 
mass fit, which is performed as described above. For this 
study the impact of the yy — > hadrons background was ne- 
glected. The results are shown in Table 2. We find an aver- 
age resolution of at least 5 x 10 _5 GeV _1 is required in or- 
der for the momentum resolution not to be the dominant un- 
certainty contribution in a 2 ab _1 measurement of the decay 
H — > n + u~. The average momentum resolution in the fully 
simulatedH — > u+uT sample is4x 10 _5 GeV _1 . The results 
from the fast simulation study are thus consistent with those 
found Section 5.2. 



5.4 Forward Electron Tagging 

The dominant contributions to the reducible background are 
from Z pair production, where one Z decays to a pair of 
muons and the other decays invisibly, and from the t-channel 
diagram contributing to e + e~ — » u+u~e + e~. In the latter 
the electron-positron pair goes in the very forward direc- 
tion. We have investigated a possible reduction of this back- 
ground using the forward calorimeters LumiCal and Beam- 
Cal. The distributions of energy and angle with the outgoing 
beam axis of the most and second most energetic electrons in 
e + e~ — !• u+u~e + e~ events are shown in Figure 4. Although 
most electrons are produced at very low polar angle, a large 
fraction of the electrons are within the fiducial volumes of 
the LumiCal and BeamCal, which have an acceptance of 
44 mrad and 15 mrad, respectively, with respect to the out- 
going beam axis. Since the forward calorimeters were not 
part of the full detector simulation, we assume ad-hoc elec- 
tron tagging efficiencies in these two calorimeters to reject 
background events. Afterwards, a dedicated BDT classifier 
is trained on the pre-selected background samples using the 
variables described in Section 5.1. For example, assuming 
an electron tagging efficiency of 95% in the LumiCal im- 
proves the total signal selection efficiency to 49.7%, which 
results in a statistical uncertainty on the cross-section times 
branching ratio measurement of 15.7%. Assuming a higher 
electron tagging efficiency of 99% in the LumiCal improves 
this result to approximately 15%. If the BeamCal is used in 
addition, assuming an average electron tagging efficiency of 
70% in its fiducial volume, the statistical uncertainty can be 
improved to 14.5%. 

An independent study [22] of the electron tagging effi- 
ciency in the forward calorimeters at a CLIC detector, taking 
into account the yy — > hadrons background as well as e + e~- 
pair background, confirms the efficiencies we assume here. 
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Fig. 4: Kinematic distributions of the most and the second 
most energetic electron in u. + uTe + e~ events. Left: Distribu- 
tion of the simulated electron energy. Right: Distribution of 
the polar angle with respect to the outgoing beam axis. 

6 Results 

We have demonstrated the potential of measuring the cross 
section times branching ratios of a 120 GeV Higgs boson at 
a 3 TeV CLIC with high precision. For the measurement of 
Higgs decays to quarks, 0.23% and 3.1% statistical uncer- 
tainty can be achieved for the decays H — > bb and H — » cc, 
respectively. This includes the effect of background from 
yy — s- hadrons on the flavour tagging. Given the experience 
of the LEP experiments [23] in the measurements of hadronic 
Z decays, with systematic uncertainties between 0.3% - 1.2% 
for /tf and between 1.2% and 10% for one can assume 
that a systematic uncertainty of around 1 % is achievable in 
H^bb and around 5% inH^ cc. 

For the rare decay H — > u+uT, the cross section times 
branching ratio can be measured to a precision of about 15% 
if the background from e + e~ — » u+u^e+e - can be reduced 
using tagging of electrons in the LumiCal with an efficiency 
of 95%, and the average momentum resolution is not worse 
than 5 x 10~ 5 . The effect of background from yy — > hadrons 
has been taken into account. From the measurements of the 
branching ratio of Z decays to a pair of muons at the LEP 
experiments, with systematic uncertainties between 0. 1 and 
0.4%, depending on the experiment, one can assume that the 
systematic uncertainties related to detector effects are of the 
order of 1% or less. The expected uncertainty of the peak 
luminosity is currently being studied but is estimated to be 
around 1% or less. 



6. 1 Extracting the Higgs Coupling Constants 

The uncertainties on the measurements of cross section times 
branching fraction can be translated to an uncertainty on the 
coupling constants. A global fit [24] to the complete set of 
measured electroweak observables gives the most accurate 
picture of the nature of the coupling constants. In absence 
of the full set of measurements, we estimate the achievable 
precision on the Higgs couplings in the measured channels 
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by assuming that deviations from the Standard Model pa- 
rameters occur only in the channel under consideration [25]. 
Using a recent overview of the uncertainties of the Standard 
Model Higgs branching ratios [13], Table 3 summarises con- 
servative estimates on the achievable sensitivity to Standard 
Model Higgs coupling constants. For the hadronic decays, 
even the combination of the statistical uncertainty and a con- 
servative average of the systematic uncertainties from simi- 
lar measurements at LEP, as discussed above, is dominated 
by the current theoretical uncertainties of 2.8% for H — ► bb 
and 12.2% for H — > cc. In the case of H — > u. + (JT the sta- 
tistical uncertainties will dominate both the systematic un- 
certainties and the current theoretical uncertainty of 6.4%. 



Table 3: Statistical uncertainties of the measurements of the 
cross section times branching fraction, and the sensitivity to 
the SM b, c and jJ, Yukawa Higgs coupling constants at a 
3 TeV CLIC with an integrated luminosity of 2 ab~' 
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